job

Middle AI Engineer (AI Agent)

Job Description
  • Fine-tune and optimize existing open-source LLMs (e.g., Code LLaMA, StarCoder) for code related tasks (completion, explanation, refactoring, bug fixing).
  • Process and manage large-scale, multi-language code datasets (cleaning, deduplication, quality filtering) using distributed frameworks.
  • Implement Retrieval-Augmented Generation (RAG) pipelines with vector databases for code and technical document retrieval.
  • Optimize model inference performance (quantization, distillation, GPU acceleration) for low latency servings.
  • Develop APIs/microservices to integrate AI features into IDE extensions (VS Code API, TypeScript).
  • Monitor and evaluate model performance using code-specific metrics .
  • Collaborate with Data Engineers, Product, and DevOps to ensure high-quality data, scalable deployment, and stable production performance.
  • Write clean, maintainable code, manage version control (Git), and document experiments and deployment processes.
Job Requirements

Mandatory

  • Bachelor’s degree in Computer Science, AI/ML, Data Engineering, or related field.
  • 2–5 years of experience in AI/ML, with at least 1 year working on LLM-driven systems or AI Agents.
  • Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow).
  • Hands-on experience with LLM orchestration frameworks (LangChain, LlamaIndex, Semantic Kernel).
  • Familiarity with vector databases (Pinecone, Weaviate, FAISS) and RAG architectures.
  • Experience with Docker, GitLab CI/CD (or similar), and cloud AI/ML services (AWS, GCP, Azure).
  • Understanding of distributed or big data processing tools (Spark, Dask, Ray).
  • Good English communication skills and ability to work in cross-functional teams.

 

Nice to have:

  • Experience with multi-agent systems and coordination strategies.
  • Familiarity with TypeScript and building APIs/IDE integrations for AI-powered tools.
  • Knowledge of advanced embeddings (AST parsing, syntax-aware embeddings, knowledge graphs).
  • Experience with MLOps/AgentOps tools (MLflow, W&B, Guardrails, Kubeflow).
  • Contributions to open-source projects in AI Agents, LLMOps, or coding assistants.
How To Apply

Step 1: Submit Your CV
Apply directly through the Innotech Vietnam Corporation website by uploading your updated CV to the careers section.

Step 2: Phone Screening
If shortlisted, you will receive a call from the HR team for a brief survey and initial screening.

Step 3: Interview
Qualified candidates will be invited to an interview to further assess suitability for the role.

Step 4: Onboarding
Successful applicants will receive an offer and begin the onboarding process to officially join the team.

SEND US YOUR CV

Level: Middle

Expiring date: 30/09/2025

Benefits:

Location: Di An, Binh Duong (near HCMC National Univeristy)

What We Offer

• 13th-month salary calculated based on actual working time at INNOTECH.
• PVI Healthcare Insurance for all employees
• PVI Healthcare Insurance for family
• Holiday celebrations
• Quarterly/project kickoff team-building budget.
• Monthly birthday parties with cakes and celebrations
• Laptop and monitor provided for work.
• Performance bonus plan.
• Employee referral bonus: 2,000,000 – 10,000,000 VND (depending on level/role).
• Annual company trips / Football club / Badminton club / Year-end party.
• Learning and certification support.
• Value-oriented, international working environment with a flexible culture.

Become a member of us









    Smaller than 2 MB

    You know this job via?

    FacebookAcquaintanceGoogleOthers

    I AGREE WITH INNOTECH VIETNAM TERM AND PRIVACY